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We show that given a 3-colorable graph, it is N P-hard to find a 3-coloring with (ff + e) of 
the edges bichromatic. In a related result, we show that given a satisfiable instance of the 2-to-l 
O Label Cover problem, it is N P-hard to find a (|| + e)-satisfying assignment. 
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1 Introduction 



Graph coloring problems differ from many other Constraint Satisfaction Problems (CSPs) in that 
we typically care about the case of perfect completeness, e.g. when the graph under consideration 
is 3-colorable rather than almost 3-colorable. Unfortunately, this means that many of the powerful 
tools which have been developed for proving inapproximability results are no longer applicable. 
Most prominently, Raghavendra's [Rag08] optimal inapproximability results for all CSPs, which are 
conditioned on the unproven Unique Games Conjecture (UGC), only apply to the case of imperfect 
completeness. The UGC states that it is NP-hard to distinguish between nearly satisfiable and 
almost completely unsatisfiable instances of Unique, or 1-to-l, Label Cover. As a result, by starting 
a reduction with the UGC, one has already lost perfect completeness. Thus, any inapproximability 
result for a graph coloring problem must begin with a different unproven assumption, such as 
P / NP or Khot's [Kho02] 2-to-l Conjecture. 

The motivation of this paper is to study both of these assumptions as they relate to the graph 
fc-Coloring problem, specifically in the k = 3 case. In the /c-Coloring problem, the input is a 
fc-colorable graph G, and the task is to find a /c-coloring of the vertices of G which maximizes 
the number of bichromatic edges. This problem has previously gone under the names "Max-fc- 
Colorability" [Pet94] and "Maximum fc-Colorable Subgraph" [GS09]. Graph A;-Coloring, along with 
its many studied variants, is a central problem in Computer Science, and pinning down its exact 
approximability is an important open problem. The main result of our paper is an improved 
inapproximability result for 3-Coloring, predicated only on P / NP: 

Theorem 1.1. For all e > 0, (1, jl + e)-deciding the 3-Coloring problem is NP-hard. 

Here by (c, s)-deciding a CSP we mean the task of determining whether an instance is at least c- 
satisfiable or less than s-satisfiable. In fact, this is the best known hardness result for the 3-Coloring 
problem, even assuming conjectures such as the 2-to-l Conjecture. The previous best NP-hardness 
for 3-Coloring was due to Guruswami and Sinop [GS09], who showed a factor ||-hardness via 
a somewhat involved gadget reduction from the 3-query adaptive PCP result of [GLST98]. In 
contrast, the best current algorithm achieves an approximation ratio of 0.836 (and does not need 
the instance to be satisfiable) [GW04]. As for larger values of k, [GS09] construct a reduction which 
directly translates hardness results for 3-Coloring into hardness results for /c-Coloring, for k > 3. 
Applying this to our Theorem 1.1 yields 

Theorem 1.2. For all k > 3 and e > 0, it is HP-hard to (1, 1 — i7(fc + c fc )+ Cfc +e)-decide the k-Coloring 
problem. Here Ck = k (mod 3). 

This is the best known NP-hardness for /c-Coloring. For sufficiently large k, stronger inapprox- 
imability results are known to follow from the 2-to-l Conjecture: 

2-to-l Conjecture ([Kho02]). For every integer e > 0, there is a label set size q such that it is 
NP-hard to (1, e)-decide the 2-to-l Label Cover problem. 

Guruswami and Sinop [GS09] have shown that the 2-to-l Conjecture implies it is NP-hard to 
(1,1 — \ + 0(^))-decide the /c-Coloring problem. This result would be tight up to the O(-) by 
an algorithm of Frieze and Jerrum [FJ97]. In a prior result, Dinur, Mossel, and Regev [DMR09] 
showed that the 2-to-l Conjecture implies that it is NP-hard to C-color a 4-colorable graph for 
any constant C. (They also showed hardness for 3-colorable graphs via another Unique Games 
variant.) It is therefore clear that settling the 2-to-l Conjecture is important to the study of the 
inapproximability of graph coloring problems. 
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Interestingly, to a certain extent the reverse is also true: it is "folklore" that hardness results for 
graph 3-Coloring immediately imply hardness results for the 2-to-l Label Cover problem with label 
sizes 3 & 6 by a simple "constraint-variable" reduction. Indeed, Theorem 1.1 by itself would give 
the best-known NP-hardness for 2-to-l Label Cover. However, we are able to get an even better 
hardness result than this by studying a CSP closely related to 3-Coloring. Our hardness result is: 

Theorem 1.3. For all e > 0, (1, |§ + e)-deciding the 2-to-l Label Cover problem with label set sizes 
3 & 6 is MP -hard. 

By duplicating labels, this result also holds for label set sizes 3k & 6k for any k 6 N + . To 
the best of our knowledge, no explicit NP-hardness for this problem has previously been stated in 
the literature. Combining the constraint-variable reduction with the above-mentioned 3-Coloring 
hardness of [GS09] gives an NP-hardness of (1, || +e) for the problem with label sizes 3 & 6, which 
we believe to be the best previously known. It is not known how to take advantage of larger label 
set sizes. On the other hand, for label set sizes 2 Sz 4 it is known that satisfying 2-to-l Label Cover 
instances can be found in polynomial time. 

Regarding the hardness of the 2-to-l Label Cover problem, the only evidence we have is a 
family of integrality gaps for the canonical SDP relaxation of the problem, in [GKO + 10]. Re- 
garding algorithms for the problem, an important recent line of work beginning in [ABS10] (see 
also [BRS11, GS11, StelO]) has sought subexponential-time algorithms for Unique Label Cover 
and related problems. In particular, Steurer [StelO] has shown that for any constant /3 > and 
label set size, there is an exp(0(n^))-time algorithm which, given a satisfiable 2-to-l Label Cover 
instance, finds an assignment satisfying an exp(— 0(l//3 2 ))-fraction of the constraints. E.g., there is 
a 2°( n ' •'-time algorithm which (1, so)-approximates 2-to-l Label Cover, where sq > is a certain 
universal constant. 

In light of this, it is interesting not only to seek NP-hardness results for certain approximation 
thresholds, but to additionally seek evidence that nearly full exponential time is required for these 
thresholds. This can done by assuming the Exponential Time Hypothesis (ETH) [IP01] and by 
reducing from the Moshkovitz-Raz Theorem [MR10], which shows a near-linear size reduction 
from 3Sat to the standard Label Cover problem with subconstant soundness. In this work, we 
show reductions from 3Sat to the problem of (1, s + e)-approximating several CSPs, for certain 
values of s and for all e > 0. In fact, though we omit it in our theorem statements, it can be 
checked that all of the reductions in this paper are quasilinear in size for e = e{n) = O ( (logiogn)/ 3 ) ' 
for some j3 > 0. 

1.1 Our techniques 

Let us describe the high-level idea behind our result. The folklore constraint-variable reduction 
from 3-Coloring to 2-to-l Label Cover would work just as well if we started from "3-Coloring with 
literals" instead. By this we mean the CSP with domain Z3 and constraints of the form u v i — Vj 7^ c 
(mod 3)". Starting from this CSP — which we call 2NLin(Zj3) — has two benefits: first, it is at 
least as hard as 3-Coloring and hence could yield a stronger hardness result; second, it is a bit more 
"symmetrical" for the purposes of designing reductions. Finally, having proven a hardness result 
for 2NLin, it seems reasonable that it can be modified into a hardness result for 3-Coloring. We 
obtain the following hardness result for 2NLin(Zj3). 

Theorem 1.4. For all e > 0, it is NP-hard to (1, ^ + e)-decide the 2NLin problem. 

As 3-Coloring is a special case of 2NLin(Z3), [GS09] also shows that (1, §§ + e)-deciding 2NLin 
is NP-hard for all e > 0, and to our knowledge this was previously the only hardness known for 
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2NLin(Zj3). Further, the 0.836-approximation algorithm for 3-Coloring from above achieves the same 
approximation ratio for 2NLin(Zj3), and this is the best known [GW04]. To prove Theorem 1.4, 
we proceed by designing an appropriate "function-in-the-middle" dictator test, as in the recent 
framework of [OW12]. Although the [OW12] framework gives a direct translation of certain types 
of function-in-the-middle tests into hardness results, we cannot employ it in a black-box fashion. 
Among other reasons, [OW12] assumes that the test has "built-in noise", but we cannot afford this 
as we need our test to have perfect completeness. 

Thus, we need a different proof to derive a hardness result from this function-in-the-middle test. 
We first were able to accomplish this by an analysis similar to the Fourier-based proof of 2Lin(Zj2) 
hardness given in Appendix F of [OW12]. Just as that proof "reveals" that the function-in-the- 
middle 2Lin(Zj2) test can be equivalently thought of as Hastad's 3Lin(Zj2) test composed with the 
3Lin(Z 2 )-to-2Lin(Z 2 ) gadget of [TSSWOO], our proof for the 2NLin(Z 3 ) function-in-the-middle test 
revealed it to be the composition of a function test for a certain four-variable CSP with a gadget. 
We have called the particular four-variable CSP 4-Not-AII-There, or 4NAT for short. Because it 
is a 4-CSP, we are able to prove the following NP-hardness of approximation result for it using a 
classic, Hastad-style Fourier-analytic proof. 

Theorem 1.5. For all e > 0, it is NP-hard to (1, | + e)-decide the 4N AT problem. 

Thus, the final form in which we present our Theorem 1.3 is as a reduction from Label-Cover 
to 4NAT using a function test (yielding Theorem 1.5), followed by a 4NAT-to-2NLin(Zj3) gadget 
(yielding Theorem 1.4), followed by the constraint- variable reduction to 2-to-l Label Cover. Indeed, 
all of the technology needed to carry out this proof was in place for over a decade, but without 
the function-in-the-middle framework of [OW12] it seems that pinpointing the 4NAT predicate as 
a good starting point would have been unlikely. 

Our proof of Theorem 1.1 is similar: we design a function-in-the-middle test for 3-Coloring which 
uses the 2NLin(Zj3) function test as a subroutine. And though we do not find a gadget reduction 
from 3-Coloring to 4NAT, we are able to express the success probability of the test in terms of the 
4NAT test. Thus, there is significant overlap in the proofs of our two main theorems, and we are 
able to carry out the proofs simultaneously. 

1.2 Organization 

We leave to Section 2 most of the definitions, including those of the CSPs we use. The heart of the 
paper is in Section 3, where we give the 2NLin(Zj3), 3-Coloring, and 4NAT function tests and explain 
how they are related. Section 4 contains the Fourier analysis of the tests. The actual hardness proof 
for 4NAT is presented in Section 5, and it follows mostly the techniques put in place by Hastad 
in [HasOl]. Because the hardness proof for 3-Coloring is almost identical, we omit it. Appendix A 
contains a technical lemma. 

2 Preliminaries 

We primarily work with strings x £ 7L^ for some integer K. We write Xi to denote the ith coordinate 
of x. 

2.1 Definitions of problems 

An instance X of a constraint satisfaction problem (CSP) is a set of variables V, a set of labels D, and 
a weighted list of constraints on these variables. We assume that the weights of the constraints are 
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nonegative and sum to 1. The weights therefore induce a probability distribution on the constraints. 
Given an assignment to the variables / : V — > D, the value of / is the probability that / satisfies a 
constraint drawn from this probability distribution. The optimum of I is the highest value of any 
assignment. We say that an X is s-satisfiable if its optimum is at least s. If it is 1-satisfiable we 
simply call it satisfiable. 

We define a CSP V to be a set of CSP instances. Typically, these instances will have similar 
constraints. We will study the problem of (c, s)- deciding V. This is the problem of determining 
whether an instance of V is at least c-satisfiable or less than s-satisfiable. Related is the problem 
of (c, s) -approximating V, in which one is given a c-satisfiable instance of V and asked to find 
an assignment of value at least s. It is easy to see that (c, s)-deciding V is at least as easy as 
(c, s)-approximating V . Thus, as all our hardness results are for (c, s)-deciding CSPs, we also prove 
hardness for (c, s)-approximating these CSPs. 

We now state the four CSPs that are the focus of our paper. 

3- coloring: In this CSP the label set is Z3 and the constraints are of the form V{ 7^ Vj. 

2-NLin(Zj3): In this CSP the label set is Z3 and the constraints are of the form 

Vi — Vj ^ a (mod 3), a £ Z3. 

The special case when each RHS is is the 3-Coloring problem. We often drop the (Z3) from this 
notation and simply write 2NLin. The reader may think of the 'N' in 2 N Li 11(^3) as standing for 
'N'on-linear, although we prefer to think of it as standing for 'N'early-linear. The reason is that 
when generalizing to moduli q > 3, the techniques in this paper generalize to constraints of the 
form u Vi — Vj (mod q) G {a, a + 1}" rather than "vi — Vj 7^ a (mod q)" . For the ternary version 
of this constraint, — Vj + v^ (mod q) G {a, a + 1}", it is folklore 1 that a simple modification of 
Hastad's work [HasOl] yields NP-hardness of (1, |)-approximation. 

4- Not-AII-There: For the 4-Not-AII-There problem, denoted 4NAT, we define 4NAT : %\ -)• {0, 1} 

to have output 1 if and only if at least one of the elements of Z3 is not present among the four 
inputs. The 4NAT CSP has label set D = Z3 and constraints of the form 4NAT(?;i + k\, V2 + k2, V3 + 
/c3,t>4 + ki) = 1, where the k^s are constants in Z3. 

We additionally define the "Two Pairs" predicate Two Pair : 7L\ — > {0,1}, which has output 1 
if and only if its input contains two distinct elements of Z3, each appearing twice. Note that an 
input which satisfies TwoPair also satisfies 4NAT. 

d-to-1 Label Cover: An instance of the d-to-1 Label Cover problem is a bipartite graph G = 
(U U V, E), a label set size K, and a ci-to-1 map 7r e : [dK] — > [K] for each edge e £ E. The elements 
of U are labeled from the set [K], and the elements of V are labeled from the set [dK]. A labeling 
f : U UV [dK] satisfies an edge e = (u, v) if 7T e (f(v)) = f(u). Of particular interest is the d = 2 
case, i.e., 2-to-l Label Cover. 

Label Cover serves as the starting point for most NP-hardness of approximation results. We 
use the following theorem of Moshkovitz and Raz: 

Theorem 2.1 ([MR10]). For any e = e(n) > n~ o(1) there exists K,d < 2 poly ( 1 / e ) such that the 
problem of deciding a 3Sat instance of size n can be Karp-reduced in poly(n) time to the problem 
of (1, e)- deciding d-to-1 Label Cover instance of size n l+ °^ with label set size K. 

1 Venkatesan Guruswami, Subhash Khot personal communications. 
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2.2 Gadgets 



A typical way of relating two separate CSPs is by constructing a gadget reduction which translates 
from one to the other. A gadget reduction from CSPi to CSP2 is one which maps any CSPi 
constraint into a weighted set of CSP2 constraints. The CSP2 constraints are over the same set of 
variables as the CSPi constraint, plus some new, auxiliary variables (these auxiliary variables are 
not shared between constraints of CSPi). We require that for every assignment which satisfies the 
CSPi constraint, there is a way to label the auxiliary variables to fully satisfy the CSP2 constraints. 
Furthermore, there is some parameter < 7 < 1 such that for every assignment which does not 
satisfy the CSPi constraint, the optimum labeling to the auxiliary variables will satisfy exactly 7 
fraction of the CSP2 constraints. Such a gadget reduction we call a ^-gadget-reduction from CSPi 
to CSP2- The following proposition is well-known: 

Proposition 2.2. Suppose it is NP-hard to (c, s)-decide CSPi. If there exists a 7- gadget-reduction 
from CSPi to CSP2, then it is NP-hard to (c + (1 — 0)7, s + (1 — s)^f)-decide CSP2. 

We note that the notation 7-gadget-reduction is similar to a piece of notation employed by 
[TSSWOO], but the two have different (though related) definitions. 

2.3 Fourier analysis on Z 3 

Let u) = e 27 ™/ 3 and set U3 = {uj° , u 1 , u> 2 } . For a G Z3, consider the Fourier character Xa '■ ^3 — > U3 
defined as Xa(x) = uj a ' x . Then it is easy to see that E[x a (x)xp(x)] = l[a = 0\, where here and 
throughout x has the uniform probability distribution on Z3 unless otherwise specified.. As a 
result, the Fourier characters form an orthonormal basis for the set of functions / : Z3 — > U3 under 
the inner product (/, g) = E,[f(x)g(x)]; i.e., 

f = ^2 

where the /(a)'s are complex numbers defined as f{a) = ~E[f(x)xa(x)}. For a G Z3, we use the 
notation \a\ to denote ^ oti and #a to denote the number of nonzero coordinates in a. When d is 
clear from context and a G ^f K , define -^3(0) G so that (^3(0))^ = \a[i]\ (mod 3) (recall the 
notation a[i] from the beginning of this section). We have Parseval's identity: for every / : Z3 — > U3 
it holds that Ylaew,% \f( a )\ 2 = 1- Note that this implies that |/(a)| < 1 for all a, as otherwise 

|/(a)| 2 would be greater than 1. 

A function / : Z3 — > Z3 is said to be folded if for every x G Z3 and c G Z3, it holds that 
f(x + c) = f(x) + c, where (x + c)j = Xj + c. 

Proposition 2.3. Let f : -)• U 3 be folded. Then f(a) ^ \a\ = 1 (mod 3). 
Proof. 

/(a) = E[/(a; + l) Xa (x + l)} = E[uf(x) Xa (x)x Q (l,l,...,l)] = wXa(l, 1, l)/(a). 
This means that wx a (l, 1, . . . , 1) must be 1. Expanding this quantity, 

So, \a\ = 1 (mod 3), as promised. □ 



5 



2.4 Dictatorship tests 

In this paper, we make use dictatorship tests, which are a standard tool for proving NP-hardness 
of approximation results. Generally speaking, the input of a dictatorship test is two functions 
/ : %^ — > Z3 and g : Tj^ k — > Z3 and a d-to-1 map tt : [dK] — > [K]. The map it naturally groups 
strings y £ 7L^ K into K separate "blocks" of coordinates, the first block being the coordinates in 
7r _1 (l), the second block being the coordinates in 7r _1 (2), etc. Without loss of generality we will 
assume that ir is the map where ir(k) = 1 for 1 < k < d, n(k) = 2 for d + 1 < k < 2d, and so on. In 
this case, we write y[i] £ 7L\ for the ith block of y, and £ for the jth. coordinate of this 

block. 

The goal of a dictatorship test is to distinguish the case when / and g are "matching dictators" 
from the case when / and g are "far from matching dictators". A function / is a dictator if 
f{x) = Xi, for some i. Furthermore, / and g are matching dictators if f{x) = Xi, g(y) = yj, and 
7r(j) = i. In other words, they are dictators whose dictator coordinates match up according to the 
map 7r. A property of matching dictators is that both / and g "depend on" certain coordinates, 
meaning that these coordinates are important to the output of / and g, and these coordinates 
match each other. Thus, / and g are far from matching dictators if there are no coordinates i and 
j which / and g depend on, respectively, for which = i. An example of this is "nonmatching" 
dictators, when, say, f(x) = x\ and g(y) = y ( i+i- 

To prove hardness for 3-Coloring (i.e., the 7^ constraint), one should construct a dictatorship 
test with the following outline: first, the test picks x £ Z3" and y £ Ti^ K from some random 
distribution, and checks whether f(x) ^ g(y). If indeed this is the case, then the test passes, and 
otherwise it fails. Generally, if one is interested in showing that (c, s)-deciding a given problem is 
NP-hard, it suffices to construct a test for which matching dictators pass with probability at least 
c and functions far from matching dictators pass with probability less than s. 

We use a variant of this outline proposed in [OW12], in which the test involves a third auxiliary 
function h : S — > Z3, where S is some set. We still want to distinguish the cases of / and g 
being matching dictators and functions far from matching dictators, but now the outline is a little 
different: in addition to selecting x and y, we also select from some distribution a string z £ S. 
Then with some probability we test h{z) 7^ /(a?) and with some probability h{z) ^ g{y)- A test 
with this outline we refer to as a "function- in-the- middle" test, as h acts as an intermediary between 
the functions / and g. 

3 3-Coloring and 2NLin tests 

In this section, we give our hardness results for 3-Coloring and 2-to-l Label Cover, following the 
proof outlines described at the end of Section 1.1. First, we state a pair of simple gadget reductions: 

Lemma 3.1. There is a 3 / '4- gadget-reduction from 4NAT to 2NLin. 

Lemma 3.2. There is a \j 2- gadget-reduction from 2NLin to 2-to-l Label Cover. 

Together with Proposition 2.2, these imply the following corollary: 

Corollary 3.3. There is a 7 / 8- gadget-reduction from 4NAT to 2-to-l Label Cover. Thus, if it is 
NP-hard to (c, s)-decide the 4N AT problem, then it is NP-hard to ((7 + c)/8, (7 + s)/8)-decide the 
2-to-l Label Cover problem. 

The gadget reduction from 4NAT to 2NLin relies on the simple fact that if a, b,c,d€ Z3 satisfy the 
4NAT predicate, then there is some element of Z3 that none of them equal. 
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Proof of Lemma 3.1. A 4 NAT constraint C on the variables S = (v i, v%, v%, V4) is of the form 

4NAT(wi + ki,v 2 + k 2 ,v 3 + k 3 ,V4 + k 4 ), 

where the k^s are all constants in Z3. To create the 2NLin instance, introduce the auxiliary variable 
yc and add the four 2NLin equations 

Vi + h^yc (mod 3), i € [4]. (1) 

If / : S — > Z3 is an assignment which satisfies the 4NAT constraint, then there is some a € Z3 
such that f{vi) + ki 7^ a (mod 3) for all i £ [4]. Assigning a to yc satisfies all four equations (1). 
On the other hand, if / doesn't satisfy the 4NAT constraint, then {/(i>i) + ^i}ie[4] = ^3> so 110 
assignment to yc satisfies all four equations. However, it is easy to see that there is an assignment 
which satisfies three of the equations. This gives a |-gadget-reduction from 4NAT to 2NLin, which 
proves the lemma. □ 

The reduction from 2NLin to 2-to-l Label Cover is the well-known constraint- variable reduction, 
and uses the fact that in the equation Vi — v j 7^ a (mod 3) , for any assignment to Vj there are two 
valid assignments to Vi, and vice versa. 

Proof of Lemma 3.2. A 2NLin constraint C on the variables S = (^1,^2) is of the form 

v\ — V2 7^ a (mod 3), 

for some a G Z3. To create the 2-to-l Label Cover instance, introduce the variable yc which will 
be labeled by one of the six possible functions g : S — > Z3 which satisfies C. Finally, introduce the 
2-to-l constraints yc{v\) = v\ and yc{v2) = ^2- Here v% and V2 are treated on the left as inputs to 
the function labeling yc and on the right as variables to be labeled with values in Z3. 

If / : S — > Z3 is an assignment which satisfies the 2NLin constraint, then we label yc with /. 
In this case, 

yc{vi) = f(vi), 2 = 1,2. 

Thus, both equations are satisfied. On the other hand, if / does not satisfy the 2NLin constraint, 
then any g which yc is labeled with disagrees with / on at least one of v\ or V2- It is easy to see, 
though, that a g can be selected to satisfy one of the two equations. This gives a ^-gadget-reduction 
from 2NLin to 2-to-l, which proves the lemma. □ 

3.1 Three tests 

Now that we have shown that 2NLin hardness results translate into 2-to-l Label Cover hardness 
results, we present our 2NLin function test. From here, the 3-Coloring function test follows imme- 
diately. Finally, we will show how in the course of analyzing the 2NLin test one is lead naturally to 
our 4NAT test. This correspondence between the 2NLin test and the 4NAT test parallels the gadget 
reduction from Lemma 3.1. The test is: 



2NLin Test 



Given folded functions / : 7L% Z 3j 9, h : 7L% K -> Z 3 : 
• Let x 6 7L^ and y 6 7L^ K be independent and uniformly random. 
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For each i £ [K],j G [d], select independently and uniformly from the elements of 

S 3 \ {an, (yWj}. 

With probability 4, test /(a?) / h(z); with probability |, test g(y) ^ h(z). 
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Figure 1: An illustration of the 2NLin test distribution; d = 3, K = 5 

Above is an illustration of the test. We remark that for any given block i, z[i] determines Xi 
(with very high probability), because as soon as z[i] contains two distinct elements of Z3, Xj must 
be the third element of Z3. Notice also that in every column of indices, the input to h always 
differs from the inputs to both / and g. Thus, "matching dictator" assignments pass the test with 
probability 1. (This is the case in which f(x) = x% and g(y) = (y[i])j for some i £ [K], j £ [d].) On 
the other hand, if / and g are "nonmatching dictators" , then they succeed with only ^ probability. 
This turns out to be essentially optimal among functions / and g without "matching influential 
coordinates/blocks". We will obtain the following theorem: 

Theorem 1.4 restated. For all e > 0, it is NP-hard to (1, ^ + e)-decide the 2NLin problem. 

We would like to use a similar test for our 3-Coloring hardness result, but we can no longer 
assume that the functions /, g, and h are folded. This is problematic, as without this guarantee / 
and g could both be identically and h could be identically 1, in which case the three functions 
would pass the test with probability 1. Since constant functions cannot be decoded to Label Cover 
solutions, we would like to prevent this from happening. Thus, we will add "folding tests" to force 
/ and g to look folded. Having ensured this, we are free to run the 2NLin test without worry. The 
test is: 



3-Coloring Test 



Given functions / : Z3 — > S3, g, h : % 

7 dK 



dK 
3 



• Let x G %^ and y 6 Zg K be independent and uniformly random. 

• With probability tL test f(x) 7^ f(x + 1); with probability A, test g(y) 7^ g(y + 1). 



With the remaining j| probability, run the "non-folded" version of the 2NLin test on /, g, 
and h. 



Here, by the "non-folded" version of the 2NLin test, we mean the test which is identical to the 
2NLin test, only it does not assume /, g, and h are folded. If / and g are matching dictators, then 
they always pass the folding tests, so as before they succeed with probability 1. If on the other hand 
/ and g are nonmatching dictators, then they also always pass the folding tests, so they succeed 
with probability ^ + • |f = jf • Just as before, this turns out to be basically optimal among 
functions without matching influential coordinates: 
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Theorem 1.1 restated. For all e > 0, it is HP-hard to (1, ^ + e)-decide the 3-Coloring problem. 

Let us further discuss the 2NLin test. Given x, y, and z from the 2NLin test, consider the fol- 
lowing method of generating two additional strings y',y" G ^ K which represent h's "uncertainty" 
about y. For j £ [d], if Xi = (y[i])j, then set both (y'[i])j and (y"[i])j to the lone element of 
Z3 \ {xi, (z[i])j}. Otherwise, set one of (y'[i])j or (y"[i])j to Xi, and the other one to (y[i])j. It 
can be checked that TwoPair(a?i, (y[i])j, (y'[i])j, (y"[i])j) = 1, a more stringent requirement than 
satisfying 4NAT. In fact, the marginal distribution on these four variables is a uniformly random 
assignment that satisfies the TwoPair predicate. Conditioned on x and z, the distribution on y' and 
y" is identical to the distribution on y. To see this, first note that by construction, neither (y'[i])j 
nor (y"[i])j ever equals (;z[i])j. Further, because these indices are distributed as uniformly random 
satisfying assignments to TwoPair, Pr[(y'[i])j = xi\ = Pv[{y"[i])j = Xi] = |, which matches the 
corresponding probability for y. Thus, as y, y' , and y" are distributed identically, we may rewrite 
the test's success probability as: 

Pr[/, g, and h pass the 2NLin test] = \ Pr[/(aj) / h(z)} + § Pr[g(jj) / h(z)] 

Pr[f(x)?h(z)]. 



avg < 



Pr[g(y) / h(z)], 
Pr[g(y>) ^ h(z)}, 
Pr[g(y") + h(z)} 



< 3 4 + 1 i B[4NAJ(f(x),g(y),g(y'),g(y"))}. (2) 

This is because if 4NAT fails to hold on the tuple (f(x),g(y),g(y'),g(y")), then h(z) can disagree 
with at most 3 of them. 

At this point, we have removed h from the test analysis and have uncovered what appears to be 
a hidden 4NAT test inside the 2NLin test: simply generate four strings x, y, y' , and y" as described 
earlier, and test 4NAT(/(a;), g(y), g(y'), g(y")). With some renaming of variables, this is exactly 
what our 4 NAT test does: 



4 NAT Test 



Given folded functions / : 7L^ 



Z 3 , g : %i K 



Let x G 7L^ be uniformly random. 

Select y,z,w as follows: for each i £ [K],j £ [d], select ((y[i])j, (z[i])j, (w[i])j) uniformly at 
random from the elements of Z3 satisfying TwoPair(a;j, (y[i])j, (z[i])j, (w[i])j). 

Test WAJ(f(x),g(y),g(z),g(w)). 



x 



UD 



LXI 



LXI 



y 



11210 



21111 



01211 



01210 



T~ro~ 



* 1 1 2 1 2 1 1 2 1 2 1 1 1 1 1 1 2 1 01210 1 1 1 1 2 



w I 1 I I 2 I I 1 I 2 I I I 2 I 1 I 1 I 1 I 1 I 1 I I 2 I 



Figure 2: An illustration of the 4 NAT test distribution; d = 3, K = 5 
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Above is an illustration of this test. In this illustration, the strings z and w were derived 
from the strings in Figure 1 using the process detailed above for generating y' and y" . Note that 
each column is missing one of the elements of and that each column satisfies the TwoPair 
predicate. Because satisfying TwoPair implies satisfying 4NAT, matching dictators pass this test 
with probability 1. On the other hand, it can be seen that nonmatching dictators pass the test with 
probability |. This is basically optimal among functions with no matching influential coordinates: 

Theorem 1.5 restated. For all e > 0, it is HP-hard to (1, | + e)-decide the 4N AT problem. 

Unfortunately, it is not clear if there is a similar gadget reducing 3-Coloring to 4NAT, or to 
any other simple 4CSP for that matter. However, by using Equation (2), we can still reduce the 
analysis of the 3-Coloring test to analyzing the 4NAT test: 

1 4 

Pr[/, g, and h pass the 3-Coloring test] <— • pf + — • p g 

+ P7 " (l + I nWAT(f(x),g(y),g(z),g(w))}^ . (3) 

Here pj and p g are the probabilities that / and g pass the folding test, respectively, and x, y, 
z, and w are distributed as in the 4NAT test. This equation will be the focus of our 3-Coloring 
soundness proof. 

(As one additional remark, our 2NLin test is basically the composition of the 4NAT test with 
the gadget from Lemma 3.1. In this test, if we instead performed the f(x) ^ h(z) test with 
probability g and the g(y) 7^ h(z) test with probability |, then the resulting test would basically 
be the composition of a 3NLin test with a suitable 3NLin-to-2NLin gadget.) 



4 Fourier analysis 

Let u) = e 2?n / 3 , and set C/3 = {oj°, oj 1 , w 2 }. In what follows, we identify / and g with the functions 
ojf and uj 9 , respectively, whose range is C/3 rather than Z3. Set L = dK. Define 

Dec(/, 5 ):= Y, l/>3(«))|-|^(a)| 2 -(l/2) #Q . 

This quantity corresponds to the "decodable" part of / and g. This section is devoted to proving 
the following two lemmas: 

Lemma 4.1. Let f : 7L^ — > U3 and g : %\ — > C/3 be folded. Then the probability f and g pass the 
4NAT test is at most | + ^Dec(f,g). 

Lemma 4.2. Let f : TL^ — > U3 and g : Z3 — > C/3. Then the probability f and g pass the 3-Coloring 

test is at most || + ^Dec(/, g). 

Having proven these, our hardness results follow immediately from a standard application of 
Hastad's method. See Section 5 for details. 

The first step is to "arithmetize" the 4NAT predicate. It is not hard to verify that 

4NAT(ai, a 2 ,a 3 ,a 4 ) = \ + J^X^ ~ I Yl ^'^^ 

i^j i<j<k i<j<k 

= l + Iz)^ 015 ^] " I Yl kk^w *]. 

i<j i<j<k 
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Using the symmetry between y, z, and w, we deduce 



5 + MB[f(x)g(y)] + MB[g(y)g(z)] - ME[f(x)g(y)g(z)} - MB[g(y)g(z)g(w)}. (4) 



E[4N AT (f (x) , g(y) , g(z) , g(w))] 

_5 i 2 
9 " r 3 

To analyze this expression, we will need the following lemma: 

Lemma 4.3. Let a £ A 7 £ ^jPS an< ^ * and j 6e sitc/i £/ia£ = i. Then 



E 



otherwise. 



Proof. Conditioned on ajj = a, the distribution on the values for (yj,Zj) is uniform on the six 
possibilities (a, a + 1), (a, a + 2), (a + 1, a), (a + 1, a + 1), (a + 2, a), and (a + 2, a + 2). If (3j = ~fj, 
then the expectation equals E[u)^ j ^ y ^ +Z ^ | X{ = a]. As either yj + Zj = 2a + 1 (mod 3) or 
y, + Zj = 2a + 2 (mod 3), each with probability half, this is equal to 



■■) 

On the other hand, If f3j ^ 7^, then either only one of f3j or jj is zero, or neither is zero, and 
7j = — /3j (mod 3). In the first case, the expectation is either E[w^ y J | xi = a] or E[u; 7 -? Zj ' | Xi = a] 
for a nonzero /3j or a nonzero 77, respectively. Both of these expectations are zero, as both yj and 
Zj are uniform on Z3. In the second case, 

E[t/^ +7 ^ I Si = a] =E[w^ w i-^ z J I asj = a] 
= E[a;^ ^- z ^ I SBi = a], 

which is zero, because /3j is nonzero and y^ — Zj is uniformly distributed on S3. □ 

Now we use this to find an expression for a general form of the E[f(x)g(y)g(z)] term: 
Lemma 4.4. Let ft : ->• M and 31,52 : ^3 -> ™en 



E I/i( a; )5i(y)fi'2(2)] = ^ A( 7r 3(a))3i(")32(a) (-r 



From this, we can derive the following two corollaries: 
Corollary 4.5. Let g : -> E. TTien 

Efo(y)^)] = 2 g(c*)g(-a) (-^) # . 

a:\a[i]\=0 Vi ^ ' 

Proof (assuming Lemma 4-4)- Set /1 = 1, 31 = g, and 32 = ff- The only nonzero Fourier coefficient 
of /1 is /i(0) = 1, and the only elements a E 7L\ for which 7:3(0) = are those where \a[i]\ = for 
all i. Apply Lemma 4.4 to these three functions: 



E[g(y)g(z)]= £ gi(a)g 2 (a) # 

a:|a[i]|=0 Vi 
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Since gi(a) = g(a), it remains to show that 52(a) = g(—a), and this is true because 



52(a) = E[g 2 (y)xa(y)} = E[g(y) Xa (y)} = E[g(y) X - a (y)} = B[g(y) X - a (y)} = g{-a) 



where the third equality follows from xp( z ) = ljJ ^' z = w = x~p( z )- 

Corollary 4.6. -UE[f(x)g(y)g(z)} < Dec(f,g) + |/(0)| £ Q:7r3(a)=0 \d(»)\ 2 ■ (l/2)# c 

Proof (assuming Lemma 4-4)- Applying Lemma 4.4 to /, g, and g: 



-KE\f(x)g(v)g(z)] = -3l £ f(ir 3 (a))g(a) { o 

< £ |/(vr 3 (a))|-|5(a)| 2 -(l/2) #c 



Dec(/, 5 )+ ^ |/(0)| -|5(a)| 2 -(l/2)# c 

a:7T3(o)=0 



We now prove Lemma 4.4. 
Proof of Lemma 4-4- Begin by expanding out E[fi(x)gi(y)g2(z)]: 



Wi( x )9i(y)92{z)} = fi( a )9i(P)g2h)~E[Xa(x)xf3{y)x 1 (z)}- 



We focus on the products of the Fourier characters: 



^[Xa{x)xp{y)Xi{z)\ = n E[x ai (xi)xi3[i](y[i])x^](z[i})} 

ie[K] 



We can attend to each block separately: 

EtXai(a5i)X|9[*l(y[*])X7[.-|(«[*])] = E 

= E 



ai-wi+/3[i] y [i] +j[i]-z[i] 



cti-a 



n e 



Xj = a 



(*) 



□ 



□ 



(5) 



(6) 



(7) 



Lemma 4.3 tells us that the expectation (*) is zero if f3j 7^ jj. Thus, if Equation (6) is to be 
nonzero, it must be the case that ft = 7. If this is the case, then we can rewrite Equation (7) as 



(7) = E 



j:ir(j)=i 



4) 



E 

X 



1 \ #m 

2 



Note that the exponent of w, (cq + 2|/3[i]|)a, is zero if a» = (mod 3), in which case the 

expectation is just the constant (— l/2)* /3 W. This occurs for all i S [K] exactly when a = 7T3(/3). 
If, on the other hand, on + 2\j3[i]\ is nonzero, then the entire expectation is zero because a, the 
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value of Xi, is uniformly random from T13. Thus, Equation (6) is nonzero only when a = ir^{pi) and 
P = 7, in which case it equals 




We may therefore conclude with 

(5)= A(^3(a))5i(«)52(«)(-^) # • □ 

4.1 4NAT Analysis 

In the 4NAT test, we may assume that / and g are folded, which immediately implies that 
E[f(x)g(y)] = 0. This is because x and y are independent, and hence 

E[f(x)g^)) = E[f(x)] E[g&)) =0-0 

since / and g are folded. Next, folding also implies that E[g(y)g(z)} = 0. To see this, first note 
that for any a for which \ct[i]\ = for all i, we have that \a\ = 0. Thus, any such a must satisfy 
g(a) = 0, as Proposition 2.3 implies that g(a') 7^ only when \a'\ = 1. This means the sum in 
Corollary 4.5 must be zero, which implies that ~E[g(y)g(z)] = as well. 
Equation (4) has now been reduced to 

(4) = § - lRE[f(x)g(y)g(z)] - l^E[g{y)g{z)g{w)\. (8) 

As g(y)g(z)g(w) is always in C/3, K E[g(y)g(z)g(w)] is always at least — |. Therefore, 

(8) < § - l^E[f(x)g(y)g(z)} = ? + ^Dec(/, 5 ), (9) 

using Corollary 4.6 and the fact that /(0) = by folding. This proves Lemma 4.1. 



4.2 3-Coloring Analysis 

The analysis of the 3-Coloring test is more involved, partially because we can no longer assume 
either of the functions are folded, and partially because we need a more careful analysis of the 
E[g(y)g(z)g(w)] term. Instead, we upper-bound these terms with expressions involving the empty 
coefficients /(0) and g(0), which, when large, cause the folding tests to fail with high probability. 
In addition, the analysis of the 3-Coloring test also involves analyzing the folding tests on / and g, 
and it is with these that we start. 

For a function : Z% -> U 3 , define EVEN(/i) = £ Q: , aNo |/i(«)| 2 . 

Lemma 4.7. Pr[f(x) / f(x + 1)] = 1 - EVEN(/). 



Proof. It is easy to see that Pr[f(x) ^ f(x + 1)] = f ( 1 -»E[/(i)/(s + !)])• Expanding the 



3 

expectation, 



E[f(x)f(x + l)]=J2 f(n)fW) E[ Xa (x) X p(x + 1)] 



a,/3 



J2f(a)f(P)E\ Xa (x)xp(x 
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Since 'E\x a ( x )xp{ x )] = M a = P]> this equals ^ Q |/(a)| 2 ti; l Q L Taking the real part, 



5ft^|/|2 a; -N =EVEN(/) _l J-, l/l 



" |a|^0 



EVEN(/) - i(l - EVEN(/)) = ^EVEN(/) - i 



Thus, the probability of passing the folding test is § (l - §EVEN(/) + |) = 1 - EVEN(/). □ 

Now we focus on the E[4NAT(- • • )] term. Let us upper-bound the terms in Equation (4) from 
left to right. First, 



Proposition 4.8. ME[f(x)g(y)] < H\f(0)\ 2 + |s(0)| 2 ). 



Proof. By the independence of x and y, E[f(x)g(y)] = E[/(a;)] • E[g(y)] = /(0)g(0). Then, 

Rf(O)P) < |/(0)| • |5(0)| < 2(l/(0)| 2 + |5(0)| 2 ), 
using the fact that a 2 + b 2 > 2ab for all real numbers a and b. □ 
Next, 



Lemma 4.9. ME[g(y)g(z)] < EVEN(#). 
Proof. From Corollary 4.5, 

ME[g(y)gJz)] = M £ g(a)gj^a) # " < £ \g(a)\ \g(-a)\ Q) # ° 

a:|a[i]|=0 Vi \ " / a:|a[»]|=0 Vi ^ ' 

By Cauchy-Schwarz, this is at most 



E i^)i 2 G) \ E i*(-«)i 2 UJ = E i^ 12 



\ ^ l " v " /l v 2 / \ ^ 

\ a:|a[i]|=0 Vi v ' \ a:|a[i]|=0 Vi v 7 a:|a[i]|=0 Vi 

which is clearly at most X^lalsO l5(°0l 2 = EVEN(g). □ 
Next, 

Lemma 4.10. -ME[f(x)g(y)g(z)] < |/(0)| ■ (f \g(0)\ 2 + |EVEN( 5 )) + Dec(/,g). 
Proof. By Corollary 4.6, 

-ME[f(x)g(y)g(z)} < Dec(/, 5 ) + |/(0)| £ |^H| 2 ■ (l/2) #a 

o:7T3(a)=0 

Consider the sum X) a:7r3 ( a )=o l5( a )| 2 ' (l/2)* a . The only time that #a = is when a = 0. In 
addition, no a with = 1 contributes to the sum, because such an a cannot satisfy ~K?,{a) = 
(one of its coordinates must be 1 or 2). Thus, the sum is upper-bounded by 

i5(o)i 2 + ^ E \a(*)? <\m? + \Y. i^(«)i 2 

Q:7T3(a)=0 |a|=0 

= |5(0)| 2 + ^(EVEN( 5 ) - = \\m\ 2 + ^EVEN( 5 ). 

This concludes the lemma. □ 
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The last term, F,[g(y)g(z)g(w)], is more difficult to bound. The bound we use is: 

Lemma 4.11. -ME[g(y)g(z)g(w)} < \ - §|s(0)| 2 . 

The proof of Lemma 4.11 is presented in Appendix A. 

Substituting Proposition 4.8 and Lemmas 4.9, 4.10, and 4.11 into Equation (4) and performing 
some arithmetic yields 

E[4NAT(- ■■)]<! + §Dec(/,p) + ^|/(0)| 2 + ^|/(0)||«?(0)| 2 + EVENfo) \l + . 

By plugging this bound into Equation 3, applying Lemma 4.7, and performing more arithmetic, we 
can upper bound the probability that / and g pass the 3-Coloring test by 

-lEVEN(/) - EVEN( 9 ) (l - MJ + _L|/ (0) |2 + J^|/(0)||$(0)| a + ^Dec(/, 5 ) + H. 

Note that because < 1/(0)1 < 1, the coefficient of EVEN [g) is always negative. Thus, we may 
bound — EVEN(/) and -EVEN( 5 ) by -|/(0)| 2 and — 1^(0) | 2 , respectively, resulting in a total upper 
bound of 

^ (l/(0)ll5(0)| 2 - \m\ 2 ) + ^Dec(/, 5 ) + 

The leftmost term is always at most zero, so this is at most ^Dec(/, g) + the expression claimed 
in Lemma 4.2. 



5 Hardness of 4NAT 

In this section, we show the following theorem: 

Theorem 1.5 (detailed). For all e > 0, it is UP-hard to (1, | + e)-decide the 4 NAT problem. In 
fact, in the "yes case", all 4NAT constraints can be satisfied by TwoPair assignments. 

Combining this with Lemma 3.1 yields Theorem 1.4, and combining this with Corollary 3.3 
yields Theorem 1.3. It is not clear whether this gives optimal hardness assuming perfect complete- 
ness. The 4NAT predicate is satisfied by a uniformly random input with probability |, and by the 
method of conditional expectation this gives a deterministic algorithm which (1, |)-approximates 
the 4NAT CSP. This leaves a gap of h in the soundness, and to our knowledge there are no better 
known algorithms. 

On the hardness side, consider a uniformly random satisfying assignment to the TwoPair predi- 
cate. It is easy to see that each of the four variables is assigned a uniformly random value from Z3, 
and also that the variables are pairwise independent. As any satisfying assignment to the TwoPair 
predicate also satisfies the 4NAT predicate, the work of Austrin and Mossel [AM09] immediately 
implies that (1 — e, | + e)-approximating the 4NAT problem is NP-hard under the Unique Games 
conjecture. Thus, if we are willing to sacrifice a small amount in the completeness, we can improve 
the soundness parameter in Theorem 1.5. Whether we can improve upon the soundness without 
sacrificing perfect completeness is open. 

We now arrive at the proof of Theorem 1.5. The proof is entirely standard, and proceeds by 
reduction from d-to-1 Label Cover. A nearly identical proof gives Theorem 1.1, which we omit. The 
proof makes use of our analysis of the 4NAT test, which is presented in Section 4. One preparatory 
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note: most of the proof concerns functions / : — > Z3 and g : Z3 — > Z3. However, we also 
be making use of Fourier analytic notions defined in Section 2.3, and this requires dealing with 
functions whose range is Us rather than Z3. Thus, we associate / and g with the functions ui-f 
and u 9 , and whenever Fourier analysis is used it will actually be with respect to the latter two 
functions. 

Proof. Let G = (U U V, E) be a d-to-1 Label Cover instance with alphabet size K and d-to-1 maps 
7r e : [dK] — > [K] for each edge e £ E. We construct a 4NAT instance by replacing each vertex 
in G with its Long Code and placing constraints on adjacent Long Codes corresponding to the 
tests made in the 4NAT test. Thus, each u G U is replaced by a copy of the hypercube Z3 and 
labeled by the function f u : — > Z3. Similarly, each v G V is replaced by a copy of the Boolean 
hypercube "Ef K and labeled by the function g v : ^ K — > Z3. Finally, for each edge {u, v} G E, a 
set of 4NAT constraints is placed between f u and g v corresponding to the constraints made in the 
4NAT test, and given a weight equal to the probability the constraint is tested in the 4NAT test 
multiplied by the weight of {u, v} in G. This produces a 4NAT instance whose weights sum to 1 
which is equivalent to the following test: 

• Pick an edge e = (it, v ) G E uniformly at random. 

• Reorder the indices of g v so that the kth group of d indices corresponds to TT~ 1 (k). 

• Run the 4 NAT test on f u and g v . Accept iff it does. 



Completeness If the original Label Cover instance is fully satisfiable, then there is a function 
F : U U V — > [dK] for which val(F) = 1. Set each f u to the dictator assignment f u (x) = ^f(u) 
and each g v to the dictator assignment g v (y) = Vf(v)- Let e = {u, v} G E. Because F satisfies the 
constraint 7r e , F{u) = 7r e (F(v)). Thus, f u and g v correspond to "matching dictator" assignments, 
and above we saw that matching dictators pass the 4NAT test with probability 1. As this applies 
to every edge in E, the 4NAT instance is fully satisfiable. 

Soundness Assume that there are functions {f u }ueu an d {g v }v&v which satisfy at least a | + e 
fraction of the 4NAT constraints. Then there is at least an e/2 fraction of the edges e = {u, v} G E 
for which f u and g v pass the 4NAT test with probability at least | +e/2. This is because otherwise 
the fraction of 4NAT constraint satisfied would be at most 

e\ /2 e\ t r . 2 2e e 2 2 
l -2) U + 2j + 2 (1) = 3 + 3-4 <3 +e - 

Let E f be the set of such edges, and consider {tx, v} G E\ Set L = dK. By Lemma 4.1, 



2 c 2 2 

3 + 2 - Pr ^« and 9v Pass the 4NAT test] < - + - I ^ \Ufa(<*)) \9v{a)\ 



i a:7T3(a)^0 



meaning that 

3e 



< |/t.M«))|l*(«)l 2 Q) - (io) 



a:7T3(a)^0 
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Parseval's equation tells us that Ylae%% l&> ( a ) I 2 = 1- The function g v therefore induces a probability 
distribution on the elements of . As a result, we can rewrite Equation (10) as 



3e 

— < E 

4 a~g v 



/w(vr 3 (a)) 



l[7T 3 (a)^0] 



(11) 



As previously noted, |/ u (7T3(a))| is less than 1 for all a, so the expression in this expectation as 
never greater than 1. We can thus conclude that 



3f 



< Pr 



fu(n 3 (a)) 



#a 



l[7r 3 (a)/0] > 



3e 



GOOD c 



as otherwise the expectation in Equation (11) would be less than 3e/4. Call the event in the 
probability GOOD a . When GOOD Q occurs, the following happens: 

• |/ u (vr 3 (a))| 2 > 96 2 /64. 

• #a < log 2 (8/3e). 

• 7r 3 (a) / 0. As a result, j^a > 0. 

This suggests the following randomized decoding procedure for each u £ U: pick an element 
f3 G Zg- with probability |/ u (/3)| 2 and choose one of its nonzero coordinates uniformly at random. 
Similarly, for each v G V, pick an element a G with probability |<7i,(a)| 2 and choose one of its 
nonzero coordinates uniformly at random. In both cases, nonzero coordinates are guaranteed to 
exist because all the / n 's and <7„'s are folded. 

Now we analyze how well this decoding scheme performs for the edges e = {u, v} G E' (we 
may assume the other edges are unsatisfied). Suppose that when the elements of TL^ and were 
randomly chosen, g„'s set a was in Good a , and / u 's set j3 equals 7r 3 (a). Then, as j^a < log 2 (8/3e), 
and each label in 7r 3 (a) has at least one label in a which maps to it, the probability that matching 
labels are drawn is at least 1/ log 2 (8/3e). Next, the probability that such an a and /3 are drawn is 



£ |/^ 3 (a))| 2 |^(«) 

aeGOOD 



> 



9c 



64 2 

aeGOOD 



9v{a) 



> 



9e 2 3e 27e 3 



64 



512 



Combining these, the probability that this edge is satisfied is at least 27e 3 /512 log 2 (8/3e). Thus, 
the decoding scheme satisfies at least 



27e d 



5121og 2 (8/3e) 



1^1 
\E\ 



> 



27e 4 



1024 log 2 (8/3e) 



fraction of the Label Cover edges in expectation. By the probabilistic method, an assignment to 
the Label Cover instance must therefore exist which satisfies at least this fraction of the edges. 

We now apply Theorem 2.1, setting the soundness value in that theorem equal to 0(e 5 ), which 
concludes the proof. □ 
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A Proof of E[g(y)g(z)g(w)} 

Lemma A.l (Lemma 4.11 restated). —$fE[g(y)g(z)g(w)] < \ — ^\g(Q)\ 2 . 

Given a, j3, 7 £ 7L\ , define the predicate i^{a, f3, 7) to be true whenever \a[i} \ + \/3[i] \ + \ \ = 
for all i. In addition, define the function <!>(•, •, •) as 

$(0,^ 7 )= n n 

i 6 [X]7r(j)=i V 7 

where p = a + /3, a = f3 + 7, and r = a + 7. We will begin by deriving the following expansion for 
the expectation: 

Lemma A. 2. Let gi, #2, <?3 : ZI3 — > ^3- TVien 

E[gi(y)fla(*)fls(w)] = 5i(a)g 2 (/?)53(7)^(a,/5,7)- 



Proof. Begin by expanding out Ef^d/)^^)^ («?)]: 

E [51 (y)#2( z )#3 («>)] = ^ 5i( a )52(/3)53(7)E[xQ(y)X/3(2)x 7 ( w )]- 

We focus on the products of the Fourier characters: 

E[x a (y)X/3(*)X7(™)] = II E [Xo[f|(»[«])X^fl(*[*])X7M( tl '[*])] 

We can attend to each block separately: 



(12) 



(13) 



E[xa[i](yW)x^[i](«W)x 7 w(«'W)] = E 



n e 

J- A 7/.Z.' 



i--n(j)=i- 



y,z,w 



uOtjVi+PjZi+im \ Xi = a 



(*) 



(14) 



To analyze the expectation (*), note that conditioned on Xi = a, the distribution on the values 
for (yj, Zj) is uniform on the six possibilities (a, a + 1, a + 1), (a + 1, a, a + 1), (a + 1, a + 1, a), 
(a, a + 2, a + 2), (a + 2, a, a + 2), (0 + 2, a + 2, a). Then the expectation (*) is equal to 

I ^ w «(«j+i8i+7j) + + w «j-hy + w 2(ay-hs) + w /%+7j + ^2(^+7^)^ _ 

Note that u) a i + ^ + w 2 ( a J+ft) = w « + w 2 Pj = 2 - 3#p,-. Thus, the previous equation is equal to 



I («j+ft'+7j) 

Substituting this into Equation (14) yields 



E[x«[i](!/W)X/3[i](^W)X7[i]( w ^])] = e 



UJ 



a {\ a m+m\+hm) 



n 



j:-rr(j)=i 
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which is zero unless \a[i] \ + \/3[i] \ + \ = 0. Thus, the only Fourier coefficients a, /3, and 7 which 
contribute to Equation (12) are those which satisfy ip(a, /3, 7). Furthermore, for any such a, (5, and 
7, this equation is just 

and so Equation (13) is equal to $(a,/3,7). This concludes the lemma. □ 

Define p = Pr[g(y) = 0], q = Pr[g(y) = u], and r = Pr[g(y) = u) 2 ]. For a 6 t/3, l a (y) is the 
indicator of the event g(y) = a. In the Fourier transform calculations that follow, we may write f 3 
for some function /. This is always shorthand for f(ct)f(P)f('y). Similarly, given three functions 
fl, f2, and / 3 , /1/2/3 is always shorthand for f\{(x)h{P)h{l)- 

We'll start by computing the value of |<?(0)| 2 . 

Proposition A. 3. | <?(0) | 2 = p 3 + q 3 + r 3 — 3 • pqr. 

Proof. Direct calculation shows that |(/(0)| 2 = p 2 +g 2 +r 2 — (pq+pr+rq). Then because p+g+r = 1, 
p 2 + q 2 + r 2 — (pq + pr + rq) = p 2 + q 2 + r 2 — (pq + pr + rq) ■ (p + q + r) 

9 9 9 9 9 9 9 9 9 

= p + q + r — p q — p r — q p — q r — r p — r q — 3 - pgr 

000 

= p + g +r — 3 • pgr, 

where the last step replaces p 2 — p 2 q — p 2 r with p 3 , using 1 — q — r = p, and performs similar 
replacements for q and r. □ 

Before proving Lemma 4.11, we'll need the following lemma. 

Lemma A.4. Z a eU, E[l«(a;)l«(*)l a (ti;)] = 3 • E[li(y) l u (z) l^(w)] + \g(6)\ 2 . 

Proof. The LHS is equal to 

K (i! + i'+i^)-*(a,/9,7) (15) 

j/)(a,/9,7) 

Because li + l u + l w 2 = 1, the Fourier coefficients of 1^2 may be rewritten as follows: 

i w2 = i[a = o]-ii-i w . 

When the corresponding term in Equation (15) equals 

i? + 1 + & = if + + (-ii - i J 3 

= ii(/3)i u (7) • (-ii(a) - i w (a)) + li(7)l w (a) ■ - 
+ ii(a)i w (^)-(-ii( 7 )-i w (7)) 

For the a = case, 

p 3 + g 3 + r 3 = 3 • pgr + p 3 + g 3 + r 3 — 3 • pgr = 3 • pgr + |g(0)| 2 , 
where the last step uses Proposition A. 3. Substituting these into Equation (15) yields 

(15) = |5(o)| 2 + s Y, (iiiu,i^ + i^iiL + iu,i^ii)-*(«,A7), 

which equal the RHS of the lemma. □ 
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We now proceed to the proof of Lemma 4.11 

Proof of Lemma 1^.11. Rewrite g in terms of its indicator functions, i.e. g = li + u • l u + uj 2 ■ l w 2. 
Then 

ME[g(y)g(z)g(w)]=$l E g 3 • (3, 7 ) 

= » ^ (ii + L + L 2 ) 3 -cD(a,A7) 

j/j(a„9,7) 

= K E E o6c-laibic-*(a,^7) 

ip(a,/3,"f) a,b,c&U3 



E E Wc-^ 2 iaifoic •*(<*> a 7) 

E E[l a (aj)l 6 (z)l c (ti;)] - ^ E E[l a (x)l 6 (z)l c («;)] 
E E[l a ( a )l t (z)l e («)]-Hl- E E[l a (aj)l 6 (z)l c (u;)]j 

,6c=l \ abc=l / 



3 
2 



E E[l a (x)l 6 (z)l c (u>)] - \, 



abc=l 

where the second-to-last step uses the fact that ^ a b cg[/ E[l a (a;)l;,(y)l c (z)] = 1. This concludes 
the proof, as 

E E[l B (aj)l 6 (z)l c (io)] = E E[lo(a;)lo(«)la(t»)] + 6.E[li(x) 

abc=l a&Us 

= |^(0) | 2 + 9 • E[li(a;)l w (z)l, i; 2(w)] (By Lemma A.4.) 

> l5(0)| 2 , 

using the fact that E[li(x)l w («)l w a(«>)] > 0. □ 
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